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1.  Statement  of  Problem  Studied 


This  research  project  focused  on  two  parallel,  complementary  research  activities  designed  to  address 
potential  problems  arising  in  the  transmission  of  multimedia  traffic  to  multiple  parties  over  both  high¬ 
speed  wired  and  lower  speed  wireless  communication  networks  of  the  future. 

The  first  area  was  that  of  providing  appropriate  flow/congestion  control  in  this  multicast  environment. 
Specifically,  we  have  addressed  the  objective  of  designing  good  algorithms  to  control  the  flow  of  traffic 
between  a  source  and  multiple  receives,  either  fixed  or  mobile,  located  at  possibly  diffo-ent  distances 
from  the  source,  so  that  the  receivers  can  each  handle  the  traffic  flow,  and  yet  provide  as  high  a 
throughput  as  possible,  with  queueing  delays  constrained  to  acceptable  values. 

The  second  area  of  research  involved  the  efficient  transmission  of  digital,  standards-based  video  in  this 
networking  environment.  Recognizing  the  highly  variable  nature  of  network  conditions  in  terms  of 
available  bandwidth  and  delay,  we  investigated  novel  algorithms  that  would  combine  rate  shaping  (on- 
the-fly  modification  of  the  bit  rate  of  MPEG-1  and  MPEG-2  video)  and  flow  control  strategies  to 
maximize  the  received  video  quality.  In  doing  so,  we  focused  on  techniques  that  would  make  the  video 
traffic  a  fair  and  equitable  user  of  the  network,  in  the  sense  that  it  would  compete  fairly  for  resources  with 
other  traditional  types  of  traffic  (more  specifically,  the  TCP-based  SMTP,  HTTP  and  FTP  traffic  which 
account  for  most  of  the  traffic  on  the  Internet  today). 

2.  Summary  of  the  Most  Important  Results 

We  summarize  hare  the  results  in  the  two  research  areas  studied. 

2. 1  Multicast  Flow  and  Congestion  Control  Over  Combined  Wired/Wireless 
Networks 

Multipoint  multimedia  communication  is  becoming  more  and  more  important  in  today’s  Internet 
environment  as  well  as  in  more  futuristic  mixed  wired/wireless  network  environments.  The  delivery  of 
multicast  information  over  networks  which  are  designed  primarily  for  point-to-point  communications 
poses  interesting  management  and  control  issues  that  need  to  be  resolved.  Flow  and  congestion  control  is 
one  such  challenging  problem. 

In  this  portion  of  om  work  we  proposed  and  studied  a  unified  framework  for  the  design  and  performance 
analysis  of  multicast  flow  control  algorithms,  as  well  as  some  possible  source  policies,  such  as  listening 
to  the  slowest  request,  random  listening,  weighted  sum,  estimation/prediction,  etc.  Given  the  complexity 
of  the  problem,  we  first  addressed  the  simplest  model  with  two  mobile  receives,  using  as  the  source 
control  simple  binary  on-off  rate  control. 

The  receivers  wore  each  assumed  to  be  connected  by  a  wireless  link  to  a  base  station.  The  wireless  link 
itself  was  modeled  as  a  two-state  Markov  chain,  switching  randomly  from  on  to  off,  to  account  for 
random  fading  and  randomly-occurring  severe  oror  conditions.  The  base  station,  in  turn,  was  modeled  as 
a  queue  driving  the  on-off  link  model.  Simple  messages  were  assumed  to  be  transmitted  from  the  base 
station  back  to  the  source,  dq)ending  on  the  algorithm  proposed,  indicating  the  state  of  the  queue, 
whether  above  or  below  a  specified  threshold;  or  the  actual  queue  size  itself,  as  well  as  rate  of  change  of 
the  queue.  The  base  stations  were  each  located  at  different  distances,  hence  incurring  different 
propagation  delays,  away  from  the  source.  The  receiver  characteristics  were  chosen  to  be  similar  (the 
homogeneous  case)  or  difforent  (the  hetarogeneous  case). 

Three  ad  hoc  algorithms  were  proposed  and  studied  comparatively.  These  included,  first,  the  Listen  to  the 
Slowest  Request  (LSQ)  algorithm,  in  which  each  receiver  sends  a  single-bit  “start-stop”  feedback  signal: 
a  “stop”  signal  whenevo'  the  queue  length  exceeds  a  specified  high  threshold;  a  “start”  signal  whenever 
the  queue  length  exceeds  a  specified  low  threshold.  The  source  stops  sending  traffic  whenever  at  least  one 
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receiver  has  sent  a  “stop”  signal.  It  sends  at  a  constant  rate  otherwise.  The  second,  more  complex, 
algorithm  was  the  Source  Estimation  (SE)  algorithm,  in  which  each  receive'  sends  back  both  its  queue 
length  and  the  queue  growth  rate,  whenever  that  rate  changes  sign.  The  source  then  estimates  the  future 
queue  length  of  each  receiver,  taking  the  propagation  delay  into  account.  It  transmits  at  a  constant  rate 
only  if  at  least  one  estimated  queue  length  drops  below  a  specified  low  threshold  and  all  the  other  queues 
stay  below  specified  high  thresholds.  Otherwise  the  source  stops  transmitting.  The  third  control 
algorithm  used  for  comparison  was  a  simple  open  loop  control,  in  which  the  source  collects  from  each 
receiver  at  call  setup  time  information  as  to  its  location,  its  desired  average  rate  of  recq)tion  of  data 
packets,  its  peak  rate,  and  desired  quality  of  service  requirements,  among  other  parameters.  The  source 
then  adjusts  its  rate  of  transmission  to  satisfy  those  parametCTS,  with  no  signals  fed  back. 

The  three  algorthms  were  compared,  in  the  general  case,  using  simulation.  We  were  able  to  obtain 
analytic  results  in  the  case  of  zero  propagation  delay.  These  results  were  found  to  agree  qualitatively  with 
the  simulation  results  with  propagation  delay  included.  The  results  indicated  that  the  more  complex  SE 
algorithm  was  able  to  provide  bettor  delay-throughput  performance  than  the  LSQ  algorithm.  However,  as 
the  propagation  delays  increased  the  relative  parformance  improvement  was  reduced,  since  prediction  of 
queue  lengths  was  increasingly  less  accurate.  For  rapid  link  fading  rates  the  open-loop  algorithm  becomes 
a  good  policy  to  use,  with  the  feedback  signals  providing  relatively  little  added  information.  Various 
aspects  of  this  work  have  been  presented  at  conferences  and  published  in  technical  journals.  Details 
appear  in  references  [1],  [2],  [3],  [4]  in  the  list  of  publications  that  follows. 

Afro:  studying  the  two-receiver  system  with  binary  on-off  control,  we  went  on  to  study  more  complex 
rate-based  flow  control  algorithms.  The  ATM  Forum  has  selected  a  rate-based  flow  control  mechanism  to 
support  its  projected  ABR  savice  in  the  point-to-point,  single  receive:  case,  so  it  is  of  considerable 
interest  to  study  multicast  rate-based  flow  control  in  a  more  general  case  than  the  on-off  control  case.  We 
focused  in  this  work  on  the  fixed  receiver,  high-speed  wired  environment  only, 

Specifically,  we  studied  a  “weighted  sum”  (WS)  algorithm,  which  appears  suited  to  the  multicast 
environment,  and  analytically  proved  it  to  be  stable  under  proper  parameter  settings.  This  algorithm 
extends  earlier  work  on  the  stability  of  a  unicast  feedback  flow  control  algorithm  carried  out  at  Bell  Labs. 
In  this  weighted  sum  extension  of  the  unicast  algorithm,  the  source  picks  an  appropriate  gain  factor  with 
which  to  weight  the  (propagation-delayed)  queue  size  information  received  from  each  receiver  involved 
in  the  multicast  tree  and  sums  the  weighted  information  in  determining  the  appropriate  transmission  rate 
to  use.  An  additional  damping  coefficient  is  added  to  ensure  stability.  A  detailed  analysis  was  carried  out 
of  the  two-receiver  case,  using  a  fluid-flow  approach  to  model  the  resultant  system.  The  model  results  in  a 
second-order  detCTministic  delay-differential  equation  to  be  solved.  In  gena’al,  this  is  a  very  difficult 
problem  to  attack.  We  ware  able,  however,  to  use  a  stability  approach  technique  developed  by  a 
researcher  at  NEC  C&C  Labs,  Princeton,  to  carry  out  the  stability  analysis  in  this  case,  and  come  up  with 
rules  for  determining  the  optimal  parameto'  settings  to  not  only  ensure  a  stable  systgem,  but  to  provide 
the  fastest,  oscillation-free  transient  behavior.  Details  of  this  work  appear  in  [1]  and  [5]. 

Tlie  third  and  final  portion  of  this  work  on  multicast  flow  control  involved  window-based  flow  control  in 
the  Internet  enviromnent.  The  objective  was  to  come  up  with  a  TCP-like  multicast  flow  control  algorithm 
that  retained  as  many  of  the  TCP  unicast  flow  control  features  as  possible,  including  being  responsive  to 
congestion,  yet  was  fair  in  allocating  bandwidth  to  all  users  using  the  same  network  links.  The  fairness 
issue  had  two  components:  the  multicast  traffic  had  to  be  fair  in  sharing  bandwidth  with  TCP  along  the 
same  paths;  multiple  multicast  sessions  between  the  same  sender  and  receivers  had  to  share  the  bandwidth 
equally,  on  average. 

To  accomplish  these  goals,  we  came  up  with  a  congestion-control  algorithm  we  call  the  “Random 
Listening  Algorithm”,  which  cuts  the  window  at  the  source  on  a  random  basis  when  receiving  notification 
of  congestion  from  any  receiver.  The  rationale  here  was  two-fold:  first,  occasional  packet  losses  do  not 
necessarily  mean  congestion  is  present;  second,  given  a  congested  (“bottleneck”)  link  along  a  source- 
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receiver  path,  multiple  congestion  messages  might  be  received  from  receivers  affected.  One  would  like 
the  source  to  respond  only  once  in  this  case,  not  to  each  such  message  received.  The  algorithm  was  thus 
designed  to  only  respond  to  receivers  reporting  frequent  losses.  On  receiving  a  congestion  message  from 
such  a  receiver,  the  sender  reduces  its  window  with  a  probability  1/n,  where  n  is  the  number  of  such 
receives.  Details  of  the  algorithm  appear  in  [1]  and  [6]. 

Both  analysis  and  simulation  were  used  to  show  the  algorithm  was  stable  and  fair.  To  prove  the  latter  we 
had  to  define  a  quantitative  measure  of  fairness  we  call  “essentially  fair”,  which  provides  a  lower  and 
upper  bound  on  the  multicast  throughput  with  respect  to  any  TCP  coimections  using  the  same  source- 
receiver  paths.  We  then  proved  fairness  of  the  algorithm  along  a  restricted  tree-based  topology  for  various 
congestion  cases,  for  both  drop-tail  and  RED  gateways  along  the  path.  (Drop-tail  gateways  are 
extensively  used  in  the  current  Intanet;  RED  gateways  have  been  proposed  to  reduce  network  queueing 
delay  and  to  provide  a  faira  response  to  congestion.)  Extensive  simulations  were  carried  out  to  both 
verify  our  analysis  and  to  validate  the  effectiveness  of  the  random  listening  algorithm. 

In  addition  to  the  work  described  above  on  multicast  flow  control  for  wired  and  combined  wired/wireless 
networks,  work  was  carried  out  on  multimedia  admission  control  in  cellular  mobile  wireless  networks.  A 
distributed  predictive  admission  control  algorithm  was  developed  that  provides  appropriate  throughput  to 
the  system  while  maintaining  a  fixed  call  dropping  probability  for  each  traffic  class,  as  defined  by  the 
user.  The  algorithm  is  based  on  simple  Markov  analysis  and  traffic  prediction.  It  was  shown,  by 
simulation,  to  compare  favorably  with  otha  admission  control  algortihms  proposed  in  the  Itoature.  This 
work  was  presented  at  an  international  Workshop  on  digital  communications  and  has  been  published,  in 
book  form,  as  part  of  the  Workshop  Proceedings  [7]. 

2.2  Digital  Video  and  Audio  Processing  for  Multipoint,  Multimedia 
Communications 

In  this  portion  of  our  work  we  focused  on  techniques  that  would  adapt  the  rate  of  compressed  digital 
video  and  audio  information  in  order  to  accommodate  the  dynamic  nature  of  available  bandwidth  and 
delay  in  both  single  and  multi-point  communications.  Our  approach  consisted  of  combining  signal 
processing  principles  and  flow  control  strategies  in  order  to  maximize  the  perceived  quality  of  the 
information  received  by  the  users  while  operating  within  realistic  network  performance  parameters.  In 
addition,  we  focused  on  well-known  audio  and  video  coding  techniques,  more  specifically  MPEG-1  and 
MPEG-2. 

The  problem  was  decomposed  into  two  parts.  First,  a  method  had  to  he  developed  to  allow  the 
modification  of  the  bit  rate  of  compressed  digital  audio  and  video  to  arbitrary  and  dynamically  varying 
constraints.  Several  techniques  have  been  rq)orted  in  the  literature  in  the  context  of  traditional  rate 
control,  including  some  that  take  into  account  network  conditions  (e.g.,  by  Reibman  et  al.  in  the  former 
AT&T  Bell  Labs).  These,  howevo:,  require  that  the  encoder  is  available  and  is  tightly  coupled  with  the 
network.  This  precludes  application  of  these  techniques  on  precompressed,  stored  content.  Further,  it 
requires  substantial  modification  on  the  encoding  algorithm,  something  not  easily  done  with  high-quality 
hardware-based  encodes. 

To  mitigate  these  problems,  we  used  a  refined  version  of  our  dynamic  rate  shaping  technique,  which 
provides  for  arbitrary  bit  rate  modification  of  compressed  video  according  to  user-imposed  constraints. 
Rate  shaping  works  on  any  block-based  transform  codec  (including  MPEG-l/MPEG-2,  H.261,  H.263,  M- 
IPEG).  Its  principle  lies  in  the  optimal  removal  of  transform  (typically.  Discrete  Cosine  Transform  - 
DCT)  coefficients  from  a  compressed  bitstream  prior  to  its  transmission  on  the  network.  Optimallity  hare 
is  in  the  context  of  minimizing  the  unavoidable  distortion  for  the  desired  target  bit  rate.  We  have 
developed  the  mathematical  framework  for  the  analysis  of  this  problem,  and  drived  optimal  algorithms 
as  well  as  fast  approximations  that  are  within  0.5  dB  of  the  optimal  one.  A  particularly  attractive  feature 
of  the  fast  approximations  are  that  they  are  still  near-optimal  even  if  error  propagation  is  ignored.  This 
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propagation  is  gena-ated  by  the  recursive  nature  of  video  coding  (i.e.,  coding  of  P  and  B  frames,  in  MPEG 
parlance).  The  ova:alI  complexity  of  these  algorithms  is  much  less  than  a  fulll  decoder,  thus  making  them 
suitable  for  software-only  implementations.  The  achieved  bit  rate  reduction  d^ends  on  the  original 
resolution  and  bit  rate  of  the  video  signal,  and  can  be  as  much  as  50%  or  more. 

Our  prior  work  in  this  area  only  considered  the  mathematical  problem,  and  only  fixed  target  bit  rates.  In 
OTder  to  apply  this  technique  to  a  dynamic  network  environment,  it  was  essential  to  extend  it  to  handle 
time- varying  constraints.  A  particularly  challenging  problem  is  the  origin  of  these  constraints:  in  orda:  for 
them  to  be  useful,  they  must  faithfully  represent  the  actual  available  bandwidth  on  the  given  network.  Our 
efforts  concentrated  on  the  Intonet,  as  a  very  relevant  example  of  a  dynamically- varying  network.  While 
there  are  several  techniques  for  estimating  network  bandwidth,  an  additional  desirable  characteristic  that 
we  identified  was  fairness.  In  other  words,  we  would  want  to  ensure  that  the  video  traffic  would  behave 
inside  the  network  in  exactly  the  same  way  as  traditional  data  traffic,  but  without  ignoring  video’s  special 
characteristics  (tolerance  for  losses,  need  for  low  delay  and  jitter).  Considering  that  the  vast  majority  of 
the  Internet  traffic  today  is  based  on  TCP  (which  is  the  transport  protocol  used  in  HTTP,  FTP,  and 
SMTP),  we  decided  to  use  TCP’s  window-based  flow  control  but  without  error  control.  The  benefit  of 
this  approach  is  that  the  injected  traffic  behaves  as  any  oth^  TCP-based  traffic,  while  the  delays  incurred 
by  the  retransmissions  imposed  by  error  control  are  eliminated.  By  definition,  this  appoach  guarantees 
fairness,  something  that  is  very  hard  to  ascertain  for  any  other  scheme  due  to  the  non-linearity  and 
complexity  of  the  TCP  algorithms. 

This  window-based  flow  control,  however,  cannot  be  directly  applied  to  the  rate  shapa:.  The  latter 
requires  an  actual  bit  budget  in  ordo"  to  modify  the  number  of  bits  to  be  used  on  each  fi’ame.  We  devised  a 
solution  that  uses  the  traditional  buffer-constrained  rate  control  model  used  in  video  and  audio  coding 
systms.  A  buffer  is  placed  between  the  rate  shapa*  and  the  TCP  flow  control  algorithm;  the  rate  shapa: 
modifies  the  target  bit  rate  (and  thus,  bit  budget  per  frame)  dq)ending  on  the  occupancy  of  the  buffer.  The 
buffer,  in  turn,  is  emptied  at  the  rate  specified  by  the  flow  control  algorithm.  The  benefit  of  this  technique 
is  that  it  can  provide  for  very  stable  control  of  the  bit  rate  while  avoiding  rapid  fluctuations  and  the 
corresponding  undesirable  effects  on  video  quality. 

A  complete  system  was  implemented  in  software  using  a  Sun  workstation  as  a  servo-  and  Windows  95  as 
the  client.  The  server  contained  an  optimized  implementation  of  a  rate  shaper  as  well  as  an 
implementation  of  the  TCP  flow  control  algorithm  in  usar-mode  (i.e.,  outside  the  OS  ko-nel)  operating 
over  UDP.  On  the  client  side,  software-based  video  decoding  was  used  based  on  Microsoft’s 
ActiveMovie  environment  (renamed  to  DirectShow).  In  particular,  an  ActiveMovie  filter  was  developed 
to  implement  the  client-side  of  the  flow  control  algorithm.  The  system  is  capable  of  operating  in  real¬ 
time,  providing  video  rates  up  to  30  frames  per  second  for  frame  sizes  of  160x120.  Extensive  tests  were 
performed  both  on  our  local  area  network,  and  also  aaoss  the  actual  Internet  in  connections  extending 
ovCT  30  hops.  As  a  result  of  these  tests,  some  additional  features  were  added,  such  as  selective 
rentrasmission  when  the  end-to-end  delay  allows  it  (to  increase  robustness).  Subjective  comparison  with 
several  commercial  solutions  (e.g.,  by  VDOnet)  demonstrated  that  oiu"  technique  outpoTormed  them 
(better  quality,  higher  frame  rate).  A  particularly  important  feature  is  that  the  rate  adaptation  allows  much 
higher  frame  rates  to  be  used,  thus  substantially  improving  the  user’s  expa-ience.  Results  of  this  work 
have  been  reported  in  numa-ous  journal  publications,  confa-ence  presentations,  and  demonstrations  (see 
[9],  [10],  [13]-[17].  In  addition,  a  US  patent  has  been  filed  (see  Section  5  below). 

We  also  examined  the  use  of  such  a  system  over  wireless  networks.  The  fundamental  difference  lies  in 
TCP’s  behavior  over  wireless  links.  TCP  paformance  ova  wireless  networks  can  be  poor  because  TCP 
assumes  losses  occur  in  the  network  only  when  there  is  congestion.  However,  wireless  networks  often 
drop  packets  due  to  random  errors  on  the  wireless  link.  TCP  performance  can  be  sevaely  degraded  when 
packets  with  errors  are  mistaken  for  congestion.  We  devised  a  very  simple,  tractable,  and  closed-form 
analytical  model  for  determining  the  paformance  of  TCP  ova  hybrid  wired/wireless  networks.  We 
compared  our  model  with  previous  work  in  this  area  from  an  analytical  paspective.  We  also  compared 
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our  model  with  several  simulations.  The  results  showed  that  our  model  matches  both  the  previous  work 
and  the  simulations  very  well.  We  also  provided  a  simple  heuristic  for  determining  when  wireless  errors 
will  severely  degrade  TCP  performance.  These  results  are  r^rted  in  [11]. 

In  the  course  of  this  investigation,  we  also  had  to  address  video  servCT  scalability  issues.  Since  rate 
shaping  requires  computational  resources  from  the  saver,  it  has  a  non-trivial  impact  on  the  number  of 
streams  it  can  simultaneously  serve.  We  investigated  the  paformance  aspects  of  high-performance  video 
pumps  in  our  Video-on-Demand  testbed,  and  identified  the  PDU  (protocol  data  unit)  size  as  one  of  the 
most  significant  problems  in  server  scalabiUty.  In  particular,  we  demonstrated  that  a  smaller  PDU  size 
(mandated  by  buffering  resctrictions  on  simple  receivas  such  as  set-top  boxes)  has  a  devastating  impact 
on  pumps  based  on  genaal-purpose  workstations  (even  in  high-end  systems  such  as  the  Silicon  Graphics 
Onyx/Challenge).  Our  results  are  reported  in  [12]  and  [17]. 

Our  experiments  with  the  above  techniques  have  concentrated  almost  exclusively  on  video.  However,  the 
same  principles  apply  equally  well  on  audio.  For  the  case  of  speech  signals,  there  are  additional 
considerations  that  simplify  its  treatment  in  a  multimedia  networking  environment.  In  particular,  the  fact 
that  silence  detection  can  be  used  prior  to  transmission  to  eliminate  silent  signal  parts  can  be  of  great  use 
to  mitigate  delay  and  bandwidth  variations  on  a  network.  This  is  similar  in  spirit  to  rate  shaping,  but  the 
rate  adaption  utilizes  paceptual  criteria  rather  than  being  applied  to  the  entire  signal.  We  developed  a 
novel  silence  detection  scheme  for  the  explicit  use  in  a  genaal-purpose  computing  environment.  By 
integrating  in  our  Xphone  videoconferencing  system  (see  A.  Eleftheriadis  et  al.,  “Architecture  and 
Algorithms  of  the  Xphone  Multimedia  Communication  System,”  ACM  Multimedia  Systems  Journal,  Vol. 

2,  No.  2,  August  1994,  pp.  89-100),  we  demonstrated  that  it  provides  an  excellent  tradeoff  between 
computational  simplicity  and  accuracy.  Detailed  results  and  comparisons  with  various  state-of-the-art 
schemes  are  presented  in  [8]. 
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